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We study the potential of the current Large Hadron Cohider (LHC) 7 TeV run to 
search for heavy, colored vector- like fermions, which are assumed to carry a conserved 
Z2 quantum number forcing them to be pair-produced. Each fermion is assumed to 
decay directly into a Standard Model quark and an invisible stable particle. T-odd 
quarks (T-quarks) and the lightest T-odd particle (LTP) of the Littlest Higgs model 
with T-parity provide an example of this setup. We estimate the bounds based on 
the published CMS search for events with jets and missing transverse energy in the 
35 pb~^ data set collected in the 2010 run. We find that T-quark masses below about 
450 GeV are ruled out for the LTP mass about 100 GeV. This bound is somewhat 
stronger than the published Tevatron constraint. We also estimate the reach with 
higher integrated luminosities expected in the 2011-12 run. If no deviation from the 
SM is observed, we expect that a bound on the T-quark mass of about 650 GeV, 
for the LTP mass of 300 GeV and below, can be achieved with 1 fb~^ of data. We 
comment on the possibility of using initial-state radiation jets to constrain the region 
with nearly-degenerate T-quark and LTP. 
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I. INTRODUCTION 



Searches for events with large missing transverse energy (MET) are a major focus of 
experiments at the Large Hadron Colhder (LHC), motivated mainly by the fact that such 
events are generically predicted in supersymmetric extensions of the Standard Model (SM). 
The CMS and ATLAS collaborations recently published the results of the first such search, 
in the jets+MET channel, using the 35 pb^^ of data collected during the 2010 run of the 
LHC with y/s = 7 TeV [H [2]. No excess over the expected SM background was observed, 
allowing to place bounds on the supersymmetric models which are already stronger than 
the previous Tevatron bounds. However, supersymmetry is not the only extension of the 
SM which predicts anomalous events with large MET. In fact, the prediction is very generic: 
all that's required of a model is that the heavy TeV-scale states carry a new conserved 
quantum number not carried by any SM particles, and that the lightest particle carrying this 
quantum number (which is automatically stable) be electrically neutral and color-singlet. 
Non-supersymmetric examples with these properties include models with universal extra 
dimensions (UED) [3J and Little Higgs with T-parity (LHT) |1H0]- As in supersymmetry, 
the TeV-scale particles in these models can be paired with SM states of the same gauge 
quantum numbers; unlike supersymmetry, these theories predict that the heavy particles 
have the same spin as their SM partners. Searches for jets+MET events put constraints on 
these theories as well. In this paper, we will reinterpret the null result of the CMS search [T] 
in terms of bounds on a simple extension of the SM containing vector-like fermion partners 
for the light quarks, and an additional massive gauge boson, a partner of the SM photon 
(or, more precisely, the hypercharge gauge boson). This extension can be thought of as 
a "simplified model" ^ corresponding to a limit of UED or LHT where all exotic particles 
other than the ones included in the model are either too heavy or too weakly coupled to be 
produced in significant numbers at the LHC (at least with current luminosity). In the LHT, 
such a situation is quite natural, and in fact this simplified version of the model has already 
been used as a basis for the Tevatron search for the T-odd quarks [71 [8] . We will thus refer 
to the heavy vector-like fermions as "T-quarks". In UED, the Kaluza-Klein (KK) partners 
tend to be approximately degenerate in mass, and thus a simple truncation to just a handful 



See www.lhcnewphysics.org for philosophy and examples of the simplified model approach. 
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of states is less likely to capture the phenomenology correctly; still, there is a limit of the 
model (which can be taken by adjusting brane-localized kinetic terms [U]) where our simple 
analysis would apply. 

Following Ref. [7j, we introduce four new vector-like (Dirac) fermions, Qi = {Ui, Di), with 
the same gauge quantum numbers as the SM left-handed quarks Qi, namely (3, 2)1/3. Here 
i = 1, 2 is the generation index. There's also a discrete Z2 symmetry, which we will call the 
T-parity, under which Qi — )■ —Qi. All SM states are invariant under this symmetry. We 
also introduce a massive vector boson B^^, which has no SM gauge charges, is odd under 
T-parity, and is coupled to fermions via 

Ant = eg' (QiYPiQi + h.c.) , (1) 

where sum over i is implicit and c is an order-one number whose exact value will play no role 
in our discussion, as long as the interaction is perturbative. We will assume that the four 
T-quarks have the same mass, M. This is motivated by constraints from flavor-changing 
neutral currents in the LHT model ^Q\, as well as desire for maximal simplicity. We will 
further assume that M{B) < M, so that the B is the lightest T-odd particle (LTP), and is 
therefore stable. As discussed in [7], this set of particles and mass hierarchies (with a few 
additional states above M which will play no role in this analysis) arises naturally in the 
LHT model; moreover, B can play the role of dark matter candidate [IT], further motivating 
this spectrum. 

At the LHC, the dominant production mechanism in this model is pair-production of 
T-quarks via strong interactions: 

qq U,Ui, gg UiUi, (2) 

and same for Di. The produced T-quarks decay promptly via the interaction in Eq. e.g. 

Ui ^ u,B, (3) 

producing two high-p^ jets and MET carried by the pair of -B's. Since no other decays are 
possible, the branching ratio of the decay ^ is one throughout the parameter space, inde- 
pendent of c. The phenomenology of the model is completely described by two parameters, 
M and Mltp = M{B). The main goal of this paper is to estimate the current and near- 
future reach of the LHC search for jets-|-MET in this parameter space, taking the published 
CMS search pLj as the benchmark analysis. 



4 

II. SIMULATIONS, ANALYSIS AND MSSM VALIDATION 

We implemented the model described above in MadGraph/MadEvent [12] . The process ^ 
was simulated using MadEvent, and the decays ^ of the T-quarks were simulated using 
BRIDGE [13]. (We checked, for a few representative points in the model parameter space, 
that the results are identical to simulating the production and decay together as a 2 — )■ 4 
process in MadEvent, but this approach is less computationally efficient and was not used in 
our scan over the model parameters.) The CTEQ6M PDF set [H] was used. Simulations 
were performed for a single T-quark flavor. We ignored the electroweak contribution to the 
production process, which is suppressed by a factor of order a^/a^ ~ O-O^ with respect to the 
leading QCD contribution. In this approximation, production cross section and kinematic 
distributions are identical for the 4 T-quark flavors. The parton-level events created by 
MadEvent were passed on to Pythia [15] for hadronization and showering. A fast detector 
simulation was then performed using the PGS (Pretty Good Simulation) package [16], with 
the parameters set to approximate the CMS detector.^ The resulting event sample was then 
subjected to the following set of cuts, which were chosen to approximate those used by the 
CMS study IJJ as closely as possible: 

1. Lepton veto: An event is rejected if a lepton (including an identified r) is present, 
with px > 10 GeV, \ri\ < 3, and separated by AR > 0.4 from every other object in 
the event (not including MET). 

2. Jet acceptance: Jets with px < 50 GeV or |?7| > 3 are deleted. If the resulting event 
has < 2 jets, it is discarded. 

3. Jet selection: An event is accepted if and only if it has at least two jets with px > 100 
GeV, and the leading (highest-p-r) jet is at \ri\ < 2.5. 

4. Ht cut: An event is rejected if Ht < 350 GeV, where Ht is defined as the scalar sum 
of pt's of all jets in the event. 

5. ax cut: For a 2-jet event, we define = Et2/Mt, where Et2 is the energy of the least 
energetic of the jets, and My is the transverse mass of the dijet system [IT]. For an 



^ We used the file pgs_card_CMS . dat included in the MadGraph/MadEvent package to set the PGS parame- 
ters. 
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Total Efficiency 


Signature Efficiency 


channel 


CMS 


PS 


CMS 


PS 


qq 


16.0 ±0.1 


18.8 


22.2 ±0.4 


21.7 


qg 


14.4 ±0.1 


20.3 


23.0 ±0.5 


23.9 


99 


12.0 ±0.4 


18.5 


22.5 ±2.0 


22.6 



TABLE I: Signal efficiencies, in %, for the cMSSM, LMl point. "CMS" denotes values reported 
by the CMS collaboration [Ij; "PS" refers to the analysis described in this section. 

event with > 3 jets, we first combine the jets into 2 pseudo-jets, choosing the partition 
which minimizes the Et difference between the two pseudo-jets. We then define ar in 
the same way as before, with pseudo-jets replacing the jets. In either case, the event 
is rejected if ax < 0.55. 

We simulated 10000 events for each of 391 points in the LHT parameter space, spanning 
the ranges M = 220 .. . 700 GeV, Mltp = 20 ... M - 20 GeV, with most of this area 
covered with a (20 GeV)x(20 GeV) grid. (We also explored additional points in the region 
of quasi-degenerate T-quark and LTP, see below.) For each point, we obtain the total 
cross section for production of the 4 flavors of T-quarks (which of course only depends on 
M), and the combined efficiency of the cuts. We then apply a K-factor to set the total 
cross section to its NLO value, with renormalization scale set to M [TB]. (We used the 
simple parametrization of the NLO cross section given in the Appendix B of Ref. [T9].) 
Multiplying these numbers yields the total signal cross section after cuts, which can then be 
compared with present or extrapolated bounds inferred from the CMS analysis. We estimate 
that the combined renormalization scale and PDF uncertainty on the total cross section to 
be about 20% (see, for example. Fig. 2 (c) in Ref. pL9j). By comparing cut efficiencies 
computed from simulations with fixed renormalization and factorization scales (which we 
use for our central-value estimates) to simulations with renormalization and factorization 
scales adjusted on event-by-event basis according to jet prs, for a few representative points 
in the model parameter space, we estimate the Monte Carlo uncertainty in the efficiency 
calculation to be of order 10%. Combining the two, we conservatively assign an uncertainty 
of 25% to the theoretical prediction of the signal cross section after cuts. 
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To check that this simple procedure yields sensible results, we validated it by comparing 
the cut efficiencies for the supersymmetric signal at the point LMl in the constrained minimal 
supersymmetric model (cMSSM), estimated using exactly the same procedure we use for 
T-quarks, with the corresponding efficiencies presented by CMS [1]. The results of this 
comparison are shown in Table |T) Here, "total" efficiencies denote the number of events 
passing all cuts normalized to the total number of events, while "signature" efficiencies are 
normalized to the number of events after the lepton veto. It is clear that our procedure 
gives excellent match to the CMS results for the signature efficiencies, which are within 
2 standard deviations for all 3 production channels. Our procedure does not work nearly 
as well for the total efficiencies. This is most likely due to our extremely simple-minded 
treatment of r's, which are very common in the LMl events since staus are light at that 
cMSSM point. Since the model we're studying in this paper does not predict any events with 
r's (or indeed any SM leptons) at the parton level, we do not expect this issue to affect our 
results. Indeed, the lepton veto only removes 1-3% of events in our T-quark event samples, 
so that even a factor of 2 error in the efficiency of this cut would only have a marginal 
effect on our results. On the other hand, the excellent agreement of our procedure with 
the CMS numbers at the level of signature efficiencies indicates that the procedure works 
very well when purely hadronic+MET events are considered. Since the vast majority of the 
T-quark events are of this kind, we expect our efficiencies for T-quark searches to be valid 
to a good approximation. (Of course, ideal agreement is not expected, since our modeling 
of the detector is quite crude.) 



III. CURRENT AND PROJECTED EXPERIMENTAL REACH 

The sensitivity of an experiment to the new physics signal described above is limited by 
the SM backgrounds. Dominant SM sources of multijet-|-MET events are pure QCD events 
with mis-measured jet momenta, Z+jets events with Z — z/z/, I^+jets events with W ^ Iv 
and i either not detected or misidentified, and ti events with one of the tops decaying 
leptonically. The CMS collaboration estimated the rate of SM background events passing 
the analysis cuts, by a combination of Monte Carlo simulations and data-based techniques. 
For example, an inclusive background estimate for the 35 pb~^ data set considered in [1] is 

Bg = 9.4^^0 (stat) ± 1.0 (syst). (4) 
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This number was obtained by measuring the rate of multijet+MET events in the "control 
region", i.e. the region with lower Hx than required in the analysis {Ht G [250,350] GeV), 
and extrapolating to higher Ht- Alternative estimates for electroweak and ti processes were 
obtained by measuring related processes: for example, the Z+jets rate was estimated using 
the 7+jets sample. These estimates agree, within errors, with the inclusive estimate quoted 
above. 

The CMS collaboration observed 13 events passing the selection cuts in the data, consis- 
tent with the estimate Q. This measurement can be used to place a limit on the new physics 
contribution to the rate. A systematic derivation of such a limit should take into account 
the possibility of contamination of the control region by new physics. The procedure for 
doing so is rather complicated, and the result is model-dependent. On the other hand, given 
the shape of the SM Ht distribution and the existing constraints on new physics models 
under consideration, the contamination is expected to be a minor effect, and at the level of 
phenomenological analysis of this paper, it is reasonable to ignore it. In this approximation, 
a new physics contribution after all cuts is limited to at most 13.4 events at 95% confidence 
level (CL) [1], corresponding to a cross section limit of 0.383 pb. 

The impact of this cross section limit on the parameter space of our model is shown 
in Fig. [1} The region to the left and below the solid black line is excluded. Notice that 
the bound is already stronger than the only published Tevatron bound on this model by 
the D0 collaboration [S], shown as the lightly shaded region on the figure. (To be fair, it 
should be noted that the D0 analysis was based on a 2.5 fb~^ of the Tevatron data and 
has not been updated.) The LHC search sensitivity can be rapidly improved with larger 
data sets, expected to be collected during the 2011-12 run. As an example, we present 
an estimate of the reach expected with 1 fb~^ of integrated luminosity (solid red line in 
Fig. [T]). To obtain this estimate, we rescaled the background rate from (|4]), assuming that 
the fractional statistical error will scale as 1/ y/L^ while the fractional systematic error will 
remain unchanged at approximately 10 %. We further assumed that the measured rate 
will coincide with the central value of the background estimate. This procedure yields the 
expected cross section bound (after all cuts) of 74.6 fb, which was used to compute the 
reach. According to this simple extrapolation, the analysis becomes systematics-dominated 
around 1 fb~^ of integrated luminosity, and further data does not significantly improve the 
reach: the expected cross section limit improves only to about 60 fb for 10 fb~^. 
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FIG. 1: Solid black line: Estimated exclusion contour based on the published CMS analysis [T]. 
Solid red line: Estimated reach for the same analysis with 1 fb^^ of data at 7 TeV. Dashed 
black/red lines indicate the variation of the limits assuming a 25% uncertainty on the cross section 
prediction. Lightly shaded parameter region below the dash-dotted black line is excluded by the D0 
search at the Tevatron [Bj- The region below the dotted line is ruled out by precision electroweak 
constraints in the LHT model [20], but may be allowed in a more general simplified model context. 
In the darkly shaded band, the jets that pass the analysis cuts are predominantly from initial-state 
radiation. 

A somewhat surprising feature of Fig. [T] is the apparent sensitivity of the analysis in 
the region of quasi-degenerate T-quark and LTP. Naively, the cut efficiency should become 
vanishingly small in that region: The jet energy in the T-quark rest frame is proportional to 
M — Mltp, and the T-quarks are not ultrarelativistic in the lab frame (the typical velocity 
is about 0.5), so the jets should be soft and should fail the acceptance and Ht cuts. In fact, 
the cut efficiency decreases as expected for M — Mltp ~ 100 GeV, but then approximately 
flattens out (at about 1% for the typical T-quark mass in our analysis) at M — Mltp ^ 100 
GeV (see the left panel of Fig. [2]). Our interpretation is that in this region, the hard jets 
passing the acceptance and Ht cuts are due to initial-state radiation (ISR). This is further 
conflrmed by comparing parton-level and Pythia-level events for a sample parameter point in 
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FIG. 2: Left: Combined analysis cut efficiency for the T-quark signal, as a function of the T- 
quark/LTP mass difference, for four sample values of the T-quark mass. Right: pT distribution of 
the second-hardest jet in each event, at the parton level (dotted histogram) and at the level of PGS 
output (solid histogram). In this simulation, M = 400 GeV and Mltp = 380 GeV. 

the quasi-degenerate region, shown on the right panel of Fig. [2j The parton-Ievel sample has 
no jets passing acceptance cuts, while Pythia-level pt distribution has a long tail extending 
above the 100 GeV threshold required in this analysis. Of course, the treatment of ISR in 
Pythia is approximate, and large corrections are expected at large px- In our events the 
typical ISR jet passing the cut has pr ~ 100 GeV, which is hierarchically smaller than the 
typical hard scale of the T-quark production process ~ 2M ~ 1 TeV, so the Pythia 
predictions should not be grossly incorrect. Still, a more detailed treatment (with careful jet 
matching, or, ideally, with fully differential NLO cross sections in addition to resummation 
of large logs) is needed to obtain fully reliable efficiency estimates in the quasi-degenerate 
region. To emphasize this point, in Fig. [T] we shaded the region where most of the jets 
passing the analysis cuts are from ISR. 

IV. CONCLUSIONS 

In this paper, we estimated the current and near-future LHC sensitivity to "T-quarks" , 
exotic vector-like quarks carrying a Z2 charge, which decay to an "LTP" , a weakly-interacting 
massive particle invisible in the detector, and an SM quark. We used the published CMS 
search in the jets+MET channel |1] as the benchmark analysis. We found that this search, 
based on the 35 pb~^ of data at a/s = 7 TeV collected in 2010, already improves on the 
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existing Tevatron bounds. Significant future improvement can be obtained witli 1 fb~^ data 
set, expected to be collected in 2011-12. Beyond 1 fb~^, we find that the analysis becomes 
systematics-limited, assuming that the fractional systematic error on the background esti- 
mate reported by CMS (about 10%) does not improve with more data. 

While we used the terminology of the Little Higgs model with T-parity (LTP) to describe 
the new particles present in our model, the analysis is in fact quite model- independent, and 
can be applied to any new physics model whose LHC phenomenology can be approximated 
by the simple Lagrangian in g. In this sense, our study is in the spirit of the "simplified 
model" approach to describing LHC signals of new physics. In particular, our analysis 
applies (at least approximately) to the UED model, in the limit when KK partner of the 
gluon is significantly heavier than the KK quarks. 

An interesting feature that was noted is that in the region of nearly-degenerate T-quark 
and LTP, the cut efficiencies do not vanish, due to the presence of ISR jets passing the 
acceptance and analysis cuts in a non-vanishing fraction of events. This can be especially 
important in the UED context, where such near-degenerate spectrum is generic (at least in 
the absence of large brane-localized kinetic terms). A more careful analysis of ISR jets is 
needed in this region. (See Ref. |21j for similar observations in the context of supersymmetry 
searches.) 

On a more technical note, we found that, reassuringly, the Monte Carlo tools widely used 
by phenomenologists, including PGS, do a decent job of reproducing experimental efficiencies 
for fully hadronic events. However, we did not find a good agreement with CMS lepton veto 
efficiencies, most likely due to our overly simplistic treatment of r's. This was not an issue 
in our analysis, since our signal model predicted essentially no leptons, but it does indicate 
that caution is warranted in general. 

To conclude, our analysis shows that the 7 TeV LHC runs in 2010-2012 have a potential to 
either discover the T-quarks or substantially improve the existing bounds. We encourage the 
LHC experiments to perform this search. If an excess above the SM background is observed 
in this channel, we stress that alternative interpretations, such as the model studied here, 
must be considered before it is attributed to supersymmetry. (For a recent analysis of model 
discrimination power of the LHC in this channel based on jet pt and angular distributions, 
see Ref. [22].) 
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